AITopics | Niger Delta

Collaborating Authors

Niger Delta

Learning to Interpret Weight Differences in Language Models

Goel, Avichal, Kim, Yoon, Shavit, Nir, Wang, Tony T.

arXiv.org Artificial IntelligenceOct-22-2025

Finetuning (pretrained) language models is a standard approach for updating their internal parametric knowledge and specializing them to new tasks and domains. However, the corresponding model weight changes ("weight diffs") are not generally interpretable. While inspecting the finetuning dataset can give a sense of how the model might have changed, these datasets are often not publicly available or are too large to work with directly. Towards the goal of comprehensively understanding weight diffs in natural language, we introduce Diff Interpretation Tuning (DIT), a method that trains models to describe their own finetuning-induced modifications. Our approach uses synthetic, labeled weight diffs to train a DIT-adapter, which can be applied to a compatible finetuned model to make it describe how it has changed. We demonstrate in two proof-of-concept settings (reporting hidden behaviors and summarizing finetuned knowledge) that our method enables models to describe their finetuning-induced modifications using accurate natural language descriptions.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2510.05092

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)
(5 more...)

Genre: Research Report (0.84)

Industry:

Media > Music (0.46)
Leisure & Entertainment > Sports (0.46)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Real-time, Adaptive Radiological Anomaly Detection and Isotope Identification Using Non-negative Matrix Factorization

Jones, Chandler, Bandstra, Mark, Faaland, Stefan, Lai, Yue Shi, Abgrall, Nico, Suchyta, Scott, Cooper, Reynold

arXiv.org Artificial IntelligenceSep-17-2025

Spectroscopic anomaly detection and isotope identification algorithms are integral components in nuclear nonproliferation applications such as search operations. The task is especially challenging in the case of mobile detector systems due to the fact that the observed gamma-ray background changes more than for a static detector system, and a pretrained background model can easily find itself out of domain. The result is that algorithms may exceed their intended false alarm rate, or sacrifice detection sensitivity in order to maintain the desired false alarm rate. Non-negative matrix factorization (NMF) has been shown to be a powerful tool for spectral anomaly detection and identification, but, like many similar algorithms that rely on data-driven background models, in its conventional implementation it is unable to update in real time to account for environmental changes that affect the background spectroscopic signature. We have developed a novel NMF-based algorithm that periodically updates its background model to accommodate changing environmental conditions. The Adaptive NMF algorithm involves fewer assumptions about its environment, making it more generalizable than existing NMF-based methods while maintaining or exceeding detection performance on simulated and real-world datasets.

artificial intelligence, data mining, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2507.10715

Country:

North America > United States > Tennessee > Anderson County > Oak Ridge (0.04)
North America > United States > New Mexico (0.04)
North America > United States > District of Columbia > Washington (0.04)
(9 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine (1.00)
Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.93)
Government > Military (0.68)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking

Akhtar, Mubashara, Schlichtkrull, Michael, Vlachos, Andreas

arXiv.org Artificial IntelligenceNov-8-2024

Current automated fact-checking (AFC) approaches commonly evaluate evidence either implicitly via the predicted verdicts or by comparing retrieved evidence with a predefined closed knowledge source, such as Wikipedia. However, these methods suffer from limitations, resulting from their reliance on evaluation metrics developed for different purposes and constraints imposed by closed knowledge sources. Recent advances in natural language generation (NLG) evaluation offer new possibilities for evidence assessment. In this work, we introduce Ev2R, an evaluation framework for AFC that comprises three types of approaches for evidence evaluation: reference-based, proxy-reference, and reference-less. We evaluate their effectiveness through agreement with human ratings and adversarial tests, and demonstrate that prompt-based scorers, particularly those leveraging LLMs and reference evidence, outperform traditional evaluation approaches.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2411.05375

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > United Kingdom > Scotland (0.05)
Atlantic Ocean > South Atlantic Ocean > Gulf of Guinea > Niger Delta (0.04)
(35 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Government (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG

Jin, Bowen, Yoon, Jinsung, Han, Jiawei, Arik, Sercan O.

arXiv.org Artificial IntelligenceOct-8-2024

Retrieval-augmented generation (RAG) empowers large language models (LLMs) to utilize external knowledge sources. The increasing capacity of LLMs to process longer input sequences opens up avenues for providing more retrieved information, to potentially enhance the quality of generated outputs. It is plausible to assume that a larger retrieval set would contain more relevant information (higher recall), that might result in improved performance. However, our empirical findings demonstrate that for many long-context LLMs, the quality of generated output initially improves first, but then subsequently declines as the number of retrieved passages increases. This paper investigates this phenomenon, identifying the detrimental impact of retrieved "hard negatives" as a key contributor. To mitigate this and enhance the robustness of long-context LLM-based RAG, we propose both training-free and training-based approaches. We first showcase the effectiveness of retrieval reordering as a simple yet powerful training-free optimization. Furthermore, we explore training-based methods, specifically RAG-specific implicit LLM fine-tuning and RAG-oriented fine-tuning with intermediate reasoning, demonstrating their capacity for substantial performance gains. Finally, we conduct a systematic analysis of design choices for these training-based methods, including data distribution, retriever selection, and training context length.

llm, long-context llm meet rag, retrieved passage, (12 more...)

arXiv.org Artificial Intelligence

2410.05983

Country:

North America > Puerto Rico (0.14)
Africa > West Africa (0.04)
North America > United States > Illinois (0.04)
(14 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Leisure & Entertainment (1.00)
Government (1.00)
Media > Film (0.67)
Education (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Could AI save Nigerians from devastating floods?

Al JazeeraSep-21-2024, 06:03:49 GMT

In the small village of Ogba-Ojibo in central Nigeria, sitting at the confluence of two of the nation's largest rivers – the Niger and Benue – 27-year-old Ako Prince Omali is counting the steps carved out of the dirt, which lead down the loam-coloured banks of the river Niger. This river bank, dotted with tufts of spiky grass, is where villagers come to fish or wash produce and laundry. Just last week, three of the steps were submerged during one night of rain, which raised the water level by about five metres. Normally, you can count seven steps down into the river. Now, only four remain above the surface of the water, the sticks bracing the muddy steps having washed away in the deluge.

flooding, givedirectly, omali, (15 more...)

Al Jazeera

Country:

Africa > Nigeria > Kogi State (0.06)
North America > United States > New York (0.04)
North America > Puerto Rico (0.04)
(8 more...)

Industry: Government (0.47)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

Tonneau, Manuel, de Castro, Pedro Vitor Quinta, Lasri, Karim, Farouq, Ibrahim, Subramanian, Lakshminarayanan, Orozco-Olvera, Victor, Fraiberger, Samuel P.

arXiv.org Artificial IntelligenceJun-24-2024

To address the global issue of online hate, hate speech detection (HSD) systems are typically developed on datasets from the United States, thereby failing to generalize to English dialects from the Majority World. Furthermore, HSD models are often evaluated on non-representative samples, raising concerns about overestimating model performance in real-world settings. In this work, we introduce NaijaHate, the first dataset annotated for HSD which contains a representative sample of Nigerian tweets. We demonstrate that HSD evaluated on biased datasets traditionally used in the literature consistently overestimates real-world performance by at least two-fold. We then propose NaijaXLM-T, a pretrained model tailored to the Nigerian Twitter context, and establish the key role played by domain-adaptive pretraining and finetuning in maximizing HSD performance. Finally, owing to the modest performance of HSD systems in real-world conditions, we find that content moderators would need to review about ten thousand Nigerian tweets flagged as hateful daily to moderate 60% of all hateful content, highlighting the challenges of moderating hate speech at scale as social media usage continues to grow globally. Taken together, these results pave the way towards robust HSD systems and a better protection of social media users from hateful content in low-resource settings.

computational linguistic, dataset, tweet, (15 more...)

arXiv.org Artificial Intelligence

2403.1926

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
Oceania > Australia (0.04)
(15 more...)

Genre: Research Report (1.00)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Law (0.67)
Government (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages

Adelani, David Ifeoluwa, Doğruöz, A. Seza, Shode, Iyanuoluwa, Aremu, Anuoluwapo

arXiv.org Artificial IntelligenceApr-30-2024

Naija is the Nigerian-Pidgin spoken by approx. 120M speakers in Nigeria and it is a mixed language (e.g., English, Portuguese and Indigenous languages). Although it has mainly been a spoken language until recently, there are currently two written genres (BBC and Wikipedia) in Naija. Through statistical analyses and Machine Translation experiments, we prove that these two genres do not represent each other (i.e., there are linguistic differences in word order and vocabulary) and Generative AI operates only based on Naija written in the BBC genre. In other words, Naija written in Wikipedia genre is not represented in Generative AI.

bbc genre, naija, wikipedia genre, (10 more...)

arXiv.org Artificial Intelligence

2404.19442

Country:

Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Africa > West Africa (0.04)
Africa > Nigeria > Plateau State > Jos (0.04)
(12 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.89)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.81)

Add feedback

Time Series Forecasting of HIV/AIDS in the Philippines Using Deep Learning: Does COVID-19 Epidemic Matter?

Aribe, Sales G. Jr., Gerardo, Bobby D., Medina, Ruji P.

arXiv.org Artificial IntelligenceJan-11-2024

With a 676% growth rate in HIV incidence between 2010 and 2021, the HIV/AIDS epidemic in the Philippines is the one that is spreading the quickest in the western Pacific. Although the full effects of COVID-19 on HIV services and development are still unknown, it is predicted that such disruptions could lead to a significant increase in HIV casualties. Therefore, the nation needs some modeling and forecasting techniques to foresee the spread pattern and enhance the governments prevention, treatment, testing, and care program. In this study, the researcher uses Multilayer Perceptron Neural Network to forecast time series during the period when the COVID-19 pandemic strikes the nation, using statistics taken from the HIV/AIDS and ART Registry of the Philippines. After training, validation, and testing of data, the study finds that the predicted cumulative cases in the nation by 2030 will reach 145,273. Additionally, there is very little difference between observed and anticipated HIV epidemic levels, as evidenced by reduced RMSE, MAE, and MAPE values as well as a greater coefficient of determination. Further research revealed that the Philippines seems far from achieving Sustainable Development Goal 3 of Project 2030 due to an increase in the nations rate of new HIV infections. Despite the detrimental effects of COVID-19 spread on HIV/AIDS efforts nationwide, the Philippine government, under the Marcos administration, must continue to adhere to the United Nations 90-90-90 targets by enhancing its ART program and ensuring that all vital health services are readily accessible and available.

international journal, philippines, technology and advanced engineering website, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.46338/ijetae0922_15

2401.05933

Country:

Asia > Philippines > Luzon > National Capital Region > City of Manila (0.14)
South America > Brazil (0.05)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
(20 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology > HIV (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)

Add feedback

Language Model Decoding as Direct Metrics Optimization

Ji, Haozhe, Ke, Pei, Wang, Hongning, Huang, Minlie

arXiv.org Artificial IntelligenceOct-2-2023

Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive texts which are often disjunctive in discourse, while search-based methods maintain topic coherence at the cost of increased repetition. Overall, these methods fall short in achieving holistic alignment across a broad range of aspects. In this work, we frame decoding from a language model as an optimization problem with the goal of strictly matching the expected performance with human texts measured by multiple metrics of desired aspects simultaneously. The resulting decoding distribution enjoys an analytical solution that scales the input language model distribution via a sequence-level energy function defined by these metrics. And most importantly, we prove that this induced distribution is guaranteed to improve the perplexity on human texts, which suggests a better approximation to the underlying distribution of human texts. To facilitate tractable sampling from this globally normalized distribution, we adopt the Sampling-Importance-Resampling technique. Experiments on various domains and model scales demonstrate the superiority of our method in metrics alignment with human texts and human evaluation over strong baselines.

human text, language model, preprint, (16 more...)

arXiv.org Artificial Intelligence

2310.01041

Country:

Africa > Chad (0.28)
Africa > Sudan (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(40 more...)

Genre:

Research Report > New Finding (1.00)
Personal (0.93)
Overview (0.93)

Industry:

Media (1.00)
Leisure & Entertainment > Sports > Soccer (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Comment: how ships can outwit piracy with AI

#artificialintelligenceSep-11-2021, 12:15:14 GMT

Deep learning is on the frontline in a new age of piracy, outwitting attacks with pre-emptive tech, explains Yarden Gross, CEO and co-founder of Orca AI. Almost a decade has passed since piracy raged off Somalia, and yet the danger posed by maritime hijackings is as present as ever. The global pandemic last year sparked a resurgence of attacks, with piracy incidents doubling across Asia, in a worrying uptick also seen in the Gulf of Mexico and West Africa. The fallout from coronavirus, including the loss of key security personnel, turned quarantined vessels into easy targets. This wave has since receded a little, with the International Maritime Bureau reporting a 44 per cent YoY dip in piracy and armed robbery incidents in 2021.

commercial vessel, outwit piracy, piracy, (16 more...)

#artificialintelligence

Country:

Africa > Middle East > Somalia (0.28)
North America > United States (0.25)
North America > Mexico (0.25)
(7 more...)

Industry:

Commercial Services & Supplies > Security & Alarm Services (0.93)
Transportation > Freight & Logistics Services > Shipping (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.37)

Add feedback